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Abstract 

In this work we consider dimensional improvements of the logarithmic Sobolev, Talagrand and Brascamp- 
Lieb inequalities. For this we use optimal transport methods and the Borell-Brascamp-Lieb inequality. 
These refinements can be written as a deficit in the classical inequalities. They have the right scale with 
respect to the dimension. They lead to sharpened concentration properties as well as refined contrac¬ 
tion bounds, convergence to equilibrium and short time behavior for the laws of solutions to stochastic 
differential equations. 
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Introduction 

We shall be concerned with diverse ways of measuring and bounding the distance between probability 
measures, and the links between them. We will focus on three main inequalities that we now describe. 

• A probability measure /.i on R” satisfies a logarithmic Sobolev inequality (in short LSI) with constant 
R > 0 (see [4] for instance) if for all probability measures v in R”, absolutely continuous with respect 
to /i, 

(!) 

Here H and I are the relative entropy and the Fisher information, defined for / = by 

H{v\n) = Entq/) = j f log fdfj, and I{v\n) = J (2) 

For / we assume that V/// G L 2 {v). 

• A probability measure in R” satisfies a Talagrand transportation inequality [37] with constant 
R > 0 if for all v absolutely continuous with respect to [i 

W%(v,n) < (3) 
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Here W 2 is the Monge-Kantorovich-Wasserstein distance; it is defined for p and v in P 2 (lR rl ) by 

W 2 (fi,v) = inf (JJ\v-x\>M*,y) 

where n runs over the set of (coupling) measures on IR” x R” with respective marginals p and v. 
We let P 2 (R") be the space of probability measures p on R” with finite second moment, that is, 
f |a;| 2 dp(*) < +00 (see [1], [39]). 

By the Otto-Villani Theorem [36], the logarithmic Sobolev inequality (1) implies the Talagrand 
inequality (3) with the same constant (see also [7], [39, Chap. 22]). 

• Let p be a probability measure in R" with density e~ l where V is a C 2 and strictly convex function. 
Then the Brascamp-Lieb inequality asserts that for all smooth functions /, 

Var M (/) < J V/ • Hess(H) _ 1 V/dp. (4) 

Here Var Al (/) = / / 2 dp — (/ /dp ) 2 is the variance of / under the measure p, see [4, Sect 4.9.1] for 
instance. 


The standard Gaussian measure 7 in R” with density e~ v for V(x) = |a;| 2 /2 + nlog(27r)/2, satisfies 
the three inequalities (1), (3) with R = 1 and (4). In fact, in the Gaussian case, the Brascamp-Lieb 
inequality (4) can be obtained from (1) by linearization, namely by taking v = /p with / close to 1. 
Let us note that in this case Hess(V) = Id„, the Brascamp-Lieb inequality becomes exactly the Poincare 
inequality. Moreover these inequalities are optimal for the Gaussian measure: by direct computation, 
equality holds in ( 1 ) and (3) for translations of 7 , that is, for measures v = exp (a-x— 4 -)7 with a £ R”; 
equality holds in (4) for f(x) = b ■ x, b £ I" (see [4, Chap. 4 and 5]). 


Inequalities (1), (3) and (4) share the significant property of tensorisation, leading to possible constants R 
independent of the dimension of the space. In other words, if a probability measure p satisfies one of these 
three inequalities with constant R > 0, then for any IV £ N*, the product measure p w = ® N ^ satisfies 
the same inequality with the same constant R. This can be interesting in applications to problems set in 
large or infinite dimensions. 

However, for regularity or integrability arguments, one may need more precise forms capturing the pre¬ 
cise dependence on the dimension. Such dimension dependent improvements have been observed in the 
Gaussian case. Namely, the dimensional improvement 

H (y\ 7) < ^ J \x\ 2 dv — | + ^log (l + “( 7 W7) + n ~ J \ x \ 2du )) ( 5 ) 

of the logarithmic Sobolev inequality (1) has been obtained by D. Bakry and M. Ledoux [5] by self- 
improvement from the Euclidean logarithmic Sobolev inequality, or by semigroup arguments on the Eu¬ 
clidean heat semigroup (see also [4, Sect. 6.7.1] and the early work [15] by E. Carlen). The dimensional 
improvement 

/N 2 d, + u-2nexp(/M_d7-i-^H7)) (6) 

of the Talagrand inequality (3) has been derived in [3] ; the argument is based on local hypercontractivity 
techniques on an associated Hamilton-Jacobi semigroup and fine properties of the heat semigroup. It has 
further been observed in [5] that linearizing (5) leads to the dimensional improvement 

Var 7 (/) < J | V /| 2 — ^ ( J (M 2 - n)/d 7 ) (7) 


of the Brascamp-Lieb (or Poincare) inequality (4) for the Gaussian measure (see also [4, Sect. 6.7.1]). On 
the other hand, by a spectral analysis of the Ornstein-Uhlenbeck semigroup, the bound 


Var 7 (/)< - / |V /| 2 d 7 + - 


V/d 7 


( 8 ) 
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has been established in [30, Sect. 6.2]. By the Cauchy-Schwarz inequality, it improves upon (4). Naturally, 
both inequalities (7) and (8) are optimal, and equality holds for f(x) = a ■ x; equality also holds for 
f(x) = |a:| 2 , in fact for the first two Hermite polynomials. The above proofs of (5), (6) and (8) are very 
specific to the Gaussian case and can not be extended to other measures. 

These dimensional improvements can also be written as a deficit in the classical non dimensional ver¬ 
sions (1), (3), (4) of the inequalities: namely, for the logarithmic Sobolev (LSI in short) and Talagrand 
(Tal in short) inequalities, lower bounds on the quantities 

6lsi(v\ii) := - RH(v\n) and 6 T al{v\v) ■= H(u\p) - ^ W 2 2 (i/, /z). 

The problem of dimensional refinements of standard functional inequalities has been recently considered 
in an intensive manner. Via the development of refined optimal transportation tools, beautiful results for 
the Gaussian isoperimetric inequality were obtained by Figalli-Maggi-Pratelli [25] (see also R. Eldan [20] 
or [23] for convex cones). Further recent results have been established on deficit in the logarithmic Sobolev 
inequality in the Gaussian case by Figalli-Maggi-Pratelli [26], Indrei-Marcon [32] and Bobkov & al [8]. In 
particular [8] rediscovers (5) and extends earlier results obtained in dimension one by Barthe-Kolesnikov [6] 
on the Talagrand deficit. Fathi-Indrei-Ledoux [22] also considers these deficits, particularly emphasizing 
the case where v has additional properties, such as a Poincare inequality ensuring a better constant in 
the logarithmic Sobolev inequality. Very recently D. Cordero-Erausquin [17] has studied refinements of 
the Talagrand and Brascamp-Lieb inequalities via optimal transport tools. 

Let us also quote C. Villani [39, p. 605]: 

There is no well-identified analog of Talagrand inequalities that would take advantage of the 
finiteness of the dimension to provide sharper concentration inequalities 

as a motivation to investigate further the problem. As we will see there are other striking applications of 
these dimensional refinements than sole concentration. 

Finally recall that the so-called Bakry-Emery criterion (or r 2 -criterion) ensures that the measure p, with 
density e _v satisfies the logarithmic Sobolev inequality (1) and Talagrand inequality (3) as soon as the 
potential V satisfies Hess(V) > i?Id„ with R > 0, as symmetric matrices. One of the goals of this paper 
is to extend the above dimensional inequalities under this condition with R > 0 or only Hess(V) > 0. 
For this we shall use multiple tools and we will compare our inequalities with other recent extensions. 
Applications to concentration inequalities and short and long time behaviour for the laws of solutions to 
stochastic differential equations are also given. 


Plan of the paper and main results 

Let /i be a probability measure on R n with density e -V where V is C 2 . 

In Section 1, we propose a method based on the Borell-Brascamp-Lieb inequality to get dimensional 
logarithmic Sobolev inequalities in the spirit of the works [9, 11] by S. Bobkov and M. Ledoux. The 
method is based on a general convexity inequality given in Theorem 1.1. For instance, in Corollary 1.4 
we shall prove the following : If Hess( V) > R Id„ with R > 0, then 


Ent M (/ 2 ) < n(s 


1 - log s ) + 


1 

2 R 


(1 — s) W + ^ S ~J~ 


f 2 dv 


(9) 


for any s > 0 and any function / such that / / 2 d/i = 1. This improves upon the classical logarithmic 
Sobolev inequality (1) under the Bakry-Emery condition, which is recovered for s = 1. 

In Section 2 (Theorem 2.1) we propose a dimensional Talagrand inequality through optimal transportation 
in the spirit of Barthe-Kolesnikov [6] and D. Cordero-Erausquin [16] or the recent [17] : If Hess(V) > i?Id n 
with R > 0 then 


R 


W -2 (/a, v) < v(V) — fa(V) + n — nexp — V) — nfV) — H( 


( 10 ) 


for all v £ P 2 (R"). This bound implies the classical Talagrand inequality (3). Let us observe that, using 
the terminology of the Ta-condition, the associated Markov generator L = A — W • V does not satisfy 


3 






a CD(R,n) curvature dimension condition, but only CD(R, oo). In particular the general dimensional 
log Sobolev or Talagrand inequalities, obtained on manifolds (see [4]) or on abstract measure spaces (as 
in [21]) do not hold. In Section 2.1 we show how the dimensional corrective term in our new Talagrand 
inequality enables to get sharp concentration inequalities. 

Inspired by recent results on the equivalence between contraction and CD(R , n ) condition in abstract 
measure spaces (see [1, 21, 14]), in Section 3 we consider applications to refined dimensional contraction 
properties under CD(R, oo) (see Proposition 3.3 and Corollary 3.8); we shall see how the dimension 
improves the asymptotic behaviour for the laws of solutions to stochastic differential equations (in the 
spirit of [12, 13]). Again the generator L = A — W ■ V does not satisfy a CD(R,n) condition, but 
only CD(R , oo). The key point here is to take advantage of the contribution of the diffusion term, which 
includes a dimensional term. We shall also see how the dimension influences the short time smoothing 
effect, through very simple arguments (see Proposition 3.1). 

In section 4 we prove two kinds of dimensional Brascamp-Lieb inequalities, a first one by a L 2 argument, 
a second one by a linearization argument in the Borell-Brascamp-Lieb inequality. For instance, under the 
sole assumption Hess(V) > 0, Theorem 4.3 states that 


Var M (/) < 


V/ • Hess(F )” 1 V/ dy 


(/-V/- Hess(F )- 1 VF) 2 
n + W • Hess(F )” 1 W 


( 11 ) 


for any smooth function / such that f fdfj, = 0. We shall discuss the optimality of our bounds and compare 
them with other very recent dimensional refinements of the Brascamp-Lieb inequality. 

In the Gaussian case where // = 7 , then the logarithmic Sobolev (9) (by optimising over s) and Tala¬ 
grand (10) inequalities are exactly (5) and ( 6 ) respectively, while the Poincare inequality (11) improves 
upon (7). 


Notation: whenever there is no ambiguity we shall respectively use H,I,W2,5lsi and Sto,i for 
I(zz|/z), W 2 (i', At),dLs/(i/|/z) and 5 Ta i{f\lj)- We shall sometimes let Ent dx (f) = f /log fdx and y{f) = 
f fdfjL and use the same notation for an absolutely continuous measure with respect to Lebesgue measure, 
and its density. 

1 Logarithmic Sobolev inequalities 

The Prekopa-Leindler inequality is a reverse form of the Holder inequality. Let F, G, H be non-negative 
measurable functions on R" satisfying f Fdx = f Gdx = 1, and let s, t > 0 be fixed such that t + s = 1. 
Under the hypothesis 

H(tx + sy) > F(x) t G(y) s (12) 

for any x, y £ R ra , the Prekopa-Leindler inequality ensures that f Hdx > 1, see [39, Chap. 19] for instance. 
The Borell-Brascamp-Lieb inequality is a stronger and dimensional form of the Prekopa-Leindler in¬ 
equality. Assume again J Fdx = f Gdx = 1 and in addition that F, G and H are positive; then the 
Borell-Brascamp-Lieb inequality asserts that f Hdx > 1 as soon as 

H(tx + sy) > (tF{x)~ 1/n + sG(y)- 1/n ) (13) 

for any x,y £ R", instead of the stronger (12) (by convexity); see again [39]. 

The Prekopa-Leindler inequality in particular implies many geometrical and functional inequalities as 
logarithmic Sobolev and Brascamp-Lieb inequalities, as observed by S. Bobkov and M. Ledoux in [9, 11] 
(see also [28] for an application to the modified logarithmic Sobolev inequality). In the coming sections we 
shall see how the Borell-Brascamp-Lieb inequality implies dimensional form of these inequalities. Following 
S. Bobkov and M. Ledoux [9, 11] our proofs are based on Taylor expansions when s —> 0 or F —> 0. 
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1.1 A general convexity inequality via the Borell-Brascamp-Lieb inequality 

Let us first state a general consequence of the Borell-Brascamp-Lieb inequality. It will lead to various 
dimensional logarithmic Sobolev inequalities. 

In the sequel we let ip* be the Legendre transform of a function ip on R n , defined for y £ R n by 


ip*{y ) = sup {y • x — ip{x)} € (— 00 , + 00 ]. 

zeiR" 


If ip is C 1 and strictly convex satisfying 


r V’(z) 
inn ——— = + 00 , 


| rzr | —>-+oo \X\ 

then (see [38, Sect. 2.1.3 and 2.4.3] for instance) for all x £ R", ip*(x) £ R and 
ip{x) = Vip(x) • x — ip*(Vi/>(x)) and S7ip*(Vip(x)) = x. 


(14) 


Theorem 1.1 (Convexity inequality) Let g,W be C 1 and positive functions on R” satsifying the nor¬ 
malization condition f g~ n dx = f W~ n dx = 1. Assume moreover that there exists a constant C > 0 such 
that for all x £ R", 




(HI) 


fj(\ x \ 2 + 1) < g(x) < C{\x\* + 1) and |Vg(x)| < C(|x| + 1). 


(H2) 


Then 


f W*{Vg) 

J g n+1 


dx > 0. 


(15) 


IfW is a C 1 positive and strictly convex function which satisfies (HI) and fW n dx = 1, then (15) is an 
equality forg = W. 


The same statement can be proved for a larger class of functions g and W. We only state this result with 
these restrictive hypotheses for simplicity reasons, as this setting will be sufficient for our main application. 
The rigorous proof is postponed to the Appendix A. The idea is to perform a Taylor expansion of the 
Borell-Brascamp-Lieb inequality (13) when s = 1 — t goes to 0. Indeed, let F = g~ n and G = W~ n 
in (13), hence satisfying f Fdx = J Gdx = 1. Then the function H t defined by 

H t (z)- 1/n = mf n (z + ih) + sW (2 - h)} (16) 

for z £ R" satisfies f H t dx > 1. The first-order Taylor expansion of H t , when s = 1 — t goes to 0, gives 

H t {z) = g{z)~ n -snstz) - ' 1-1 ^ • Vg{z)-g(z)) +^ ^ 1 ^ +o(s). 

Since 

J g~ n ~ 1 (z-\7g-g)dx = Q 

by integration by parts, the Taylor expansion of / H t dx > 1 implies the inequality (15). 

Applications of Theorem 1.1 are described in the coming two sections. They are based on the following 
observation. Let V be a given function and let IT = e"\ Then, from the convexity of the exponential 
function, for any a £ R and y £ R", 

W*{y) < —e a V*(jie~ a y) + (a - l)e a . 
n 

Combined with Theorem 1.1, this gives the following corollary which is the main tool in our applications: 


Corollary 1.2 Under the hypotheses of Theorem 1.1, let V = nloglT. Then for any function a, 

J ~^r[(e a V*(ne- a X7g) + n(a-l)e a )dx>Q. (17) 
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1.2 Euclidean logarithmic Sobolev inequalities 

As a warm up, let us first see how to quickly recover the classical Euclidean logarithmic Sobolev inequality, 
using (17). Let C : R n —> R + be a strictly convex function such that f e~ c dx < +oo, and let us 
apply (17) with V = C + /? and W = e v ^ n -, here j3 = log / e~ c dx so that J e~ v dx = 1. Since V is 
convex and f e~ v dx < +oo, it is classical that V grows at least linearly at infinity, so that W satisfies 
hypothesis (HI) . 

Then let p > 1. Let also / be a C 1 positive function such that f f p dx = 1 and g = f~ p ! n satisfies (H2), 
and let a = — ^ log / + u where u is a real constant. Then V* = C* — 0 and (17) can be written as 


Vu € R, 


J f p log(/ p ) dx < n{u — 1 ) — /? + 


C* 



f p dx. 


(18) 


We can optimise over u in R in the following case. Suppose that there exists q > 1 such that C is 
q-liomogeneous, that is, C(Xx) = A q C(x) for any A > 0 and x in R”. Then C* is p-homogeneous with 
1/p+l/q = 1, and in particular above C*(—pe~ u \/f/f) = p p e~ pu f~ p C*(— V/). Thus inequality (18) gives 


J f p log (f p ) dx < n(u - 1 ) - p + e~ pu p p j C*(—Vf) 


dx 


(19) 


for any function / such that f f p dx = 1 and / n ^ p satisfies (H2). Now, let / be a C 1 non negative 
and compactly supported function and for e > 0 let f s (x) = C E (e(\x\ 2 + l)~ n / p + /), where C e is 
such that f(f e ) p dx = 1. The function f E n ^ p satisfies (H2) for any e. Taking the limit when e goes 
to 0, inequality (19) then holds for any C 1 non negative and compactly supported function / such that 
J f p dx = 1. 

For the optimal u = p -1 log (p p+1 J C*(—X7f)dx/n), the bound (19) leads to 


[ f p log(/ p ) dx < - log 
J V 


( P p+1 J C*(—Vf)dx \ 
\^ne p_1 (f e~ c dx) p / n ) 


for any C 1 non negative and compactly supported function / such that J f p dx = 1 . Of course, the 
inequality can be extended to a larger class of functions /. Hence, we recover the optimal L p -Euclidean 
log Sobolev inequality proved in [19, 27] and in particular, setting G{x) = |cc| 2 /2 and p = q = 2, the 
classical inequality 

J f log(/ 2 ) dx < | log ^^ J |V/| 2 da;^ . 


1.3 Dimensional logarithmic Sobolev inequalities 

In this section we consider a probability measure p with density e _v and the function W = e V/,n . and a 
positive function / such that f f 2 dp = 1. We assume again that V is convex ; then W = e v ^ n satisfies 
hypothesis (HI) since f e~ l dx = 1. 

Corollary 1.2 applied with g = e W« f~ 2 / n (assuming that g satisfies hypothesis (H2)) and a = ^ — 
^ log / + u with u € R gives 

J (v* (e'“W - 2 e ~ U ^f) + V - log(/ 2 ) + n{u - 1)^ f 2 e~ v dx > 0. 


Corollary 1.3 Let dp(x) = e~ v ^ x ’dx be a probability measure with V a convex function and let f be a 
C 1 positive function such that f f 2 dp = 1 and such that g = e v ^ n f~ 2 ^ n satisfies hypothesis (H2). Then 
for any s > 0 


EntE*( 


sW - 2s^) 


V 


f 2 dp — n(l + logs). 


( 20 ) 


For s = 1, inequality (20) simplifies as 


Ent, 


,(/ 2 )< J v*{ 


W- 2 


f > 


+ V — n 


f dp, 


f dp = 1 . 
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In particular, for V = Bg—I- ^log(27r), then p is the standard Gaussian measure 7 and we recover the 
Gaussian logarithmic Sobolev inequality of L. Gross, 


Ent 7 (/ 2 ) < 2 J |V/| 2 g? 7 , 


fd'f=l. 


More generally, let V be a strictly convex function on R”. Then inequality (20) with s = 1, by (14) and 
integration by parts, leads to the modified logarithmic Sobolev inequality 


Ent M (/ 2 ) < J V*( 


W-2 


f > 


2x 


V/ 


- v*(w) 


f dp, 


f dp = l 


proved by the second author in [28]. 


Assuming uniform convexity on V we now optimise over the parameter s > 0 in Corollary 1.3, to obtain 
dimensional logarithmic Sobolev inequalities. Suppose that V is C 2 with Hess(H) > i?Id„ for R > 0. Then, 
for their inverse matrices, Hess(H*) < l? _ 1 Id n on R n . Hence, for any z and by the Taylor expansion at 
point \7V(x), 


V*{z) + V{x) < v*(yv(x)) + Vr(VF(s)) • (z - VV(x)) + — |z - VV(x) I 2 + V(x) 

2 R 


= X ' Z+ 2R^ Z ~ VF ( a; )| 2 - 


Here 


we use the relations (14). For z = sVH — 2 s-^- at point x, and by (20), this leads to 


Ent M (/ 2 ) < —n(l + logs) + s J x ■ (VH - .fd^ + ^ J 


(sW - 2 - VH 


/ dn- 


By integration by parts and extending to compactly supported functions, as for (19), we finally obtain: 


Corollary 1.4 (Dimensional LSI under ^-condition) Let /i be a probability measure with density 
e~ v where V is C 2 with Hess(H) > i?.Id n for R > 0. Then 


Ent M (/ 2 ) < n(s 


1 - log s ) + 


1 

2 R 


(1 - s)Vb + 2s 


V/ 


fdp 


( 21 ) 


for any s > 0 and any C 1 , non-negative and compactly supported function f such that J f 2 dp = 1 . 

The bound can of course be extended to other classes of functions /. 

When s = 1, we recover the classical logarithmic Sobolev inequality (1) under the Bakry-Emery condition. 
Let us observe that the right-hand side in (21) can be expanded as — n log s plus a second order polynomial 
in s. Hence it admits a unique minimiser s > 0, which solves a second order polynomial. The obtained ex¬ 
pression is not appealing and we prefer to omit it. In the Gaussian case where p = 7 , then the optimisation 
over s gets even simpler and leads again to the dimensional Gaussian log Sobolev inequality (5). 
Moreover, for a general V and as in (33) or (23) below for the Talagrand inequality, the bound (21) can 
be written as a (not either appealing) deficit in the log Sobolev inequality. 

We will see in Section 3.1 that (21) leads to new and sharp short time smoothing on the entropy of 
solutions to an associated Fokker-Planck equation. 


2 Talagrand inequalities 

The main result of this section is 


Theorem 2.1 (Dimensional Talagrand inequality) Let p be a probability measure in P 2 (M") with 
density e~ v where V is a C 2 function satisfying Hess(H) > PId n with R > 0. Then for all v £ P 2 (R”) 


R 

—w£(p,v) < v(V) — p(V) + n — nexp 


-(u{V)-p(V)-H(i 
n V 


( 22 ) 
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In other words, if Hess(U) > PId n , then i/(V) — p(V) — Wf{v,p) > — n and 

$Tai(v\fi) > max ^5 n ^H(i/\p) + p(V) - v{V ))), A„ (v(V) - /i(U) - (23) 

Here 5 n and A n are the positive functions respectively defined by S n (x) = n[e~ x ' n — 1 + x/n],x G R and 
A n (x) = x — n log(l + x/n),x > —n. 

The function <5i(x) = e~ x — 1 + x is positive and convex. It is moreover decreasing on R _ and increasing 
on R + . By a direct computation, 5i(a;) is bounded from below by x 2 /2 if x < 0, x 2 /e if 0 < x < 1 and 
x/e if x > 1; hence always by \ min(|x|, a; 2 ). Then for any 5 n [x) > \ min(|a;|, 2-). 

Since e u > 1 + u, the bound (22) implies the classical Talagrand inequality (3) under the condition 
Hess(H) > I?Id„. When /r is the standard Gaussian measure 7 on R", then R = 1 and we recover the 
dimensional Talagrand inequality ( 6 ). 

Under a moment condition Theorem 2.1 simplifies as follows: 


Corollary 2.2 Following the same assumptions as in Theorem 2.1, for all v in P 2 (R ra ) with v(V) < p.(V), 

t*Tal{v\lj) > 8 n {H{v\n)) > ^ min ^H{v\p), ^ . (24) 

Theorem 2.1 will be deduced from the following dimensional ifWJ-type inequality, applied with / = 1 
and v = gp. The HWI inequality bounds from above the entropy by the Wasserstein distance and the 
Fisher information (defined in (2)), in the form 

H{v\p) < W 2 ( v,p) y/l{v\p) - ^W 2 {p,u) (25) 

for all v. It has been introduced in [36] and proved in [36] and [16] under the Bakry-Emery condition 
Hess(U) > Rld n ,R € R. 

Theorem 2.3 (Dimensional HWI inequality) Let p be a probability measure onl™ with density e~ v 
where V is a C 2 function satisfying Hess(U) > PId„ with R £ R. Let also f,g be smooth functions such 
that fp and gp belong to P 2 (R")- Then 


nexp 


i (H(fp\p) - H(gp\p) + p{gV ) - p(fV)') 


— Tl 


< MgV) - T(fV) + W 2 (fp,gp)y/I(fp\p) - jW%(fp,gp). 


For g — 1 and v = //i, this bound can be written as the dimensional HWI inequality 


nexp 


[h(u\p) + p(V)-u(V)) 


H, 


~n< p(V) - v{V) + W 2 (p , v)y/l(v\p) - — W 2 (p, v). (26) 


As in (23) for the Talagrand inequality, this can equivalently be written as a deficit in the HWI inequality. 
It is classical that the HWI inequality (25) implies the logarithmic Sobolev inequality (1) (see [36] for 
instance). Likewise, from (26), one can obtain a dimension dependent logarithmic Sobolev inequality. We 
refer to Section 2.5 for further details. 

The proof of Theorem 2.3 will be given in Section 2.4. 


2.1 An application to concentration 

Let us quickly revisit K. Marton’s argument for concentration via Talagrand’s inequality (as in [39, 
Chap. 22] for instance) and see how the refined inequality (22) in Theorem 2.1 gives sharpened information 
for large deviations. 

Let dp = e - ' dx satisfy inequality (22). Let also A C R n , r > 0 and A r = { 2 ; Vy G A, \y — x\ > r}. Let 
finally pa = jpA)! 1 an d TA r = j p be the restrictions of p to A and A r . Then, as W 2 is a distance, 

r < W 2 {pA,HA r ) < W 2 (p A ,p) + W 2 (p Ar ,p )• 
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First of all 


W 2 {pa,p) < y/2R~ 1 H(p A \p) = v 7 2R~ 1 log(l /n(A)) := ca 

by (22), or its weaker form (3). Let now cy = f Vdp,x r = H(pA r \p) = log(l /fi(A r )) and V r = f VdpA r - 
By (22) again we get, for r > ca, 


Since x r 


7 2 / 

{r — ca) 2 < W%{p,A r ,v) < —\V r - cy + n - nexp 


log(l//x(A r )) we obtain : 


- (x r + Cy 
n 



Corollary 2.4 (Concentration inequality) Following the same assumptions as in Theorem 2.1, let 
A c R n , r > 0 and A r = {cc; \/y € A, \y — x\ > r}, c A = i/2i? _1 log(l//x(A)), cy = f V dp, V r = f Vdp Ar - 
Then for r > ca 


p(A r ) < e 


C V -Vr 


1 + n ( Vr ~ Cy _ CA 


Since (1 + u/n) n < e u , the bound in Corollary 2.4 implies the classical Gaussian concentration 


h(A r ) < e 


-§(r-c A ) 2 


r > c A 


of the Talagrand inequality (3), see again [39, Chap. 22] for instance. 

The bound in Corollary 2.4 captures the behaviour of concentration of the measure jjl in a more accurate 
way: let for instance V(x) = |a:| 2 /2 + |a:| p + Z p with p > 2 and a normalizing factor Z p , and A be the 
Euclidean unit ball in R". Then Hess(E) > Id n , so by Corollary 2.4 with R = 1 there exists a constant 
C = C(p, n) such that for all r > C 


n(\x\ > r + 1) = p(A r ) < exp cy — V r + n log(l + V r /n ) 

But V r > r p +Z p , so for all e < 1 there exists another constant C depending also on e such that for all r > C 


p(\x\ >r)< e- (1 - £)rP . 


This concentration inequality in this precise example can also be obtained by using a L p -Talagrand in¬ 
equality or a L p -log Sobolev inequality; however we have found it interesting to get it by means of the 
dimension dependence of the classical Talagrand inequality, moreover in a shorter and more straightfor¬ 
ward manner. 


2.2 Tensorisation and comparison with earlier results 

In R n , let W\ be the Wasserstein distance between probability measures, for the cost |y — x\,x,y € M™. 
Deficit in the Gaussian Talagrand inequality (for fi = 7 ) and for centered measures v has been investigated 
in one dimension in [ 6 ] and [ 8 ], in the form 

STali^h) > cinf / A(|y - x\)dn(x,y) > cmin {VFi(;/, q) 2 , ITi(i/, 7 )}. 
n Jlxl 


Here the c’s are diverse numerical constants and the infimum runs over couplings tt of 7 and u. 
This second lower bound has been extended in [22, Th. 5] to any dimension n , as 


STaii^h) > cmin 


V n 


tFi,i(^7)\ 

Vn ) 


(27) 


as soon as v has mean 0; here c is a numerical constant independent of the dimension n, and on R" x R n 


v) = inf 

7T 



Xi\ d-K{x,y). 


Still under a centering condition, the bound (27) has been improved in [17, Prop. 3] by replacing the 
quantity W\ t i/y/n by the larger W\ Wasserstein distance on R”, and extended to reference measures p, 
with density e -V where Hess(P) > i?.Id ra . 
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In comparison, our bound (23) has the following two advantages : it holds without any centering condition 
on v, and gives a lower bound on the deficit in terms of the relative entropy H : this is a strong way of 
measuring the gap between measures, by the Pinsker inequality for instance (see [39, Chap. 22]), and the 
relative entropy can be much larger than the weak distance W 2 . 

As considered in [17] and [22], a natural example is the product measure case when p N = ® N p and 
v N = ® N v on S, nN for N £ N*. Then 5to,i{v n \p N ) = NSraiiy\p) by tensorisation properties of both H 
and Wf ■ However, the above bound (27) in [17] (so with W\ instead of Wip/y/n) gives a lower bound on 
&Tai(v N \/x N ) equal to a constant c times 

min{Wi(u N ,p N ) 2 ,Wi(u N ,p N )) < min (W 2 (v N , p N ) 2 ,W 2 (v N , p N )) = min ^NW 2 (ix, p) 2 ,VNW 2 (v, p)^ 

since W\ < 11-2. Hence this lower bound has the good order in N at most only for small perturbations v 
of the reference measure p. 

In contrast, our bound always has the correct order in N. Indeed, if = ® N V so that dp N = e~ v <W) dx 
on M . nN , then 

H(y N \p N ) + /(V (J,) ) - ^(H (JV) ) = N {H(y\p) + p(V) - v(V )); 
hence Theorem 2.1 leads to 

S T ai(v N \p N ) >NS n (H(u\p) + p(V) - v{V )), 
which has the correct order in N. 

2.3 Useful facts on optimal transport 

In the proof of Theorem 2.3 and in proofs below we shall need the following notation and facts. 

If /i is a probability measure on R” and T : R n —> R” a Borel function, we let T be the image measure 
of /j, by T, defined by T#/x(/i) = n(h o T) for all bounded continuous functions h : R” —> R. 

Let now fx o and /ii in P 2 (R”) be absolutely continuous with respect to Lebesgue measure. Then there 
exists a convex function ip on R" such that /n = Vtp#/x o (see [38, Th. 2.12] or [39, Th. 10.41] for instance). 
The map S/ip is called the Brenier map. Moreover 

J |V<p(:r) - x\ 2 dp 0 (x) = W2(fx 0 , /n). 

Now, by the Alexandrov Theorem (see [34] or [39, Th. 14.1] for instance), a convex function ip is almost 
everywhere twice differentiable: for almost every x £ R” there exists a non negative symmetric matrix A 
such that 

ip(x + h) = ip{x) + Vip(x) ■ h + ^ + °(N 2 ) 

as h tends to 0 in R". The matrix A is denoted Hess(^!>)(a:) and called the Hessian of ip i n the sense 
of Alexandrov. The trace of A will be denoted A ip(x) : it coincides with the density of the absolutely 
continuous part of the distributional Laplacian of ip, the singular part being a non negative measure. 

In fact, in the above notation and by [34, Th. 4.4] or [1, Th. 6.2.12], Hess(^)(a;) is a positive matrix for 
/io-almost every x. Moreover, by [34] (see also [1, Lem. 5.5.3]), the Brenier map solves the Monge-Ampere 
equation 

/xo(x) = ni(Vip(x)) det(Hess(i£>)(a;)) (28) 

at /ro-almost every x in R™. Here fx o and /ii are the densities of the measures. 

Let now <p* be the Legendre transform of ip. Then no = Vy>*#/n by [38, Th. 2.12] for instance. Moreover 
\7ip*(Vp(x)) = x and Vip(Vip*(i/)) = y for /io-almost every x and /ii-almost every y. 

Furthermore, by [34, Th. A.l], if Hess(<^)(a:) is invertible at x then ip* is twice differentiable at Vip(x), 
with Hess(</?*)(V</3(a:)) = [Hess(y>)(a;)] . By the remark above, this is the case for /i 0 -almost every x. 

Finally, the curve (fx s ) s e[o,i] defined by p s = ((1 — s)Id + sX7p)#p o is a geodesic path in P 2 (R”) between 
po and pi, in the sense that 

W 2 (jx s ,ih) = \t ~ s\ W 2 {fX 0 ,fXl) 

for all 0 < s,t < 1. It holds that p s is also absolutely continuous with respect to Lebesgue measure, 
see [34, Prop. 1.3] or [38, Th. 5.9] for instance. 


10 


2.4 Proof of Theorem 2.3 

Theorem 2.3 is a consequence of the relation 

H(hp\p) - p(hV) = Ent dx {he~ v ) (29) 

written with h = f,g and of the following lemma. 


Lemma 2.5 Following the same assumptions as in Theorem 2.3, let f,g be two smooth functions such 
that f p, and gg belong to P 2 (R n ). Let ip be a convex function on R" such that \7ipff(fp) = gp. Then 

Jvgdp-Jvfdp- J ‘(Vip -x)-Vfdp> nexp i^Ent dx{fe~ v ) - Ent dx(ge~ v )^ 

+ j J (V<p(x) — x) ■ Hess(T)(x + t(Vip(x) — x))(Vip(x) — *)(1 — t)dt f(x) dp{x). 


Indeed, if Hess(E) > i?Id„, then the last term above is greater than A J |Vy> — x\ 2 f dp = -f Wf(fp,gp ). 
Moreover, on the left-hand side, 

— j (Vip — x ) • V/ dp < j | V ip — x\ 2 f dp 


"1 1/2 r f |V/| 2 1 1/2 


/ 


dp = W 2 Up, gp) UUfT\ t) 


by the Cauchy-Schwarz inequality. This implies Theorem 2.3. 

Proof of Lemma 2.5. 

By the Taylor formula, 

V{f\7ip{x)) — V(x) = VV(x) • (Vi p(x) — x)+ j (V(p(x) — x) -}less(V)(x + tU7ip(x) — x))(Vip(x) — x)(l — t)dt 

J o 

for almost every x in R". We now integrate with respect to f p and use the comparison between Alexandrov 
and distributional Laplacians to deduce that 

J Vf(i)' (Vtp(x)—x) f{x) dp(x) > J [(A tp—n)f+(\7ip—x)-\7f]dp = J A (pfdp-n+J(Vip—x)-'Vfdp, 

as in [16] or [38, Th. 9.17] for instance. This leads to 

J V g dp — J V f dp — J (Vy> — x) ■ V / dp> J A <p f dp — n 

+ JJ U7ip(x) — x) ■ Hess(V)(x+ t(Vip(x) — x))U7ip(x) — x)(l — t)dt f(x) dp(x). (30) 
Then Lemma 2.5 is a consequence of the following Lemma. 

Lemma 2.6 Let po,pi € P 2 (R") absolutely continuous with respect to Lebesgue measure, with respective 
densities also denoted po and p\. Let <p be a convex function on R" such that 'Vipjfpo = p\. Then 

(31) 


(32) 


f 

Ent dx(po) - Entdx(pi) 

j zAyz u-yLiy x IL LXp 

n 


Proof 

< Taking logarithms in the Monge-Ampere equation (28) and integrating with respect to po lead to 

Entd x (/x 0 ) = Ent dx (pi) + J log det(Hess (ip)) dp 0 . 

Now, if for each x the symmetric matrix Hess(yj) has eigenvalues <pi, then by the Jensen inequality 
j log det(Hess(<^)) dp 0 =n ^ ^ J log (tpi) dp 0 < nlog ( J dp 0 ^j =nlog JAipdpo^j. 


This concludes the proof. > 
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Remark 2.7 In the Gaussian case, we have already observed that translations of the Gaussian measure 
are extremals of the Talagrand inequality. As observed in [16], or as can be observed from the proof above, 
there are no other extremals. Indeed the Hessian of the map ip has to be constant and equal to the identity 
matrix for all inequalities to be equalities. 

In fact, if Hess(U) > Rld n , then equality in the Talagrand inequality implies that the potential is neces¬ 
sarily Gaussian and that extremals are translations of the Gaussian measure. 

2.5 Logarithmic Sobolev inequalities by transport 

As observed in [36], the HWI inequality (25) classically implies the logarithmic Sobolev inequality (1) 
by bounding from above the second order polynomial in W 2 in HWI by its maximum. Likewise, the 
dimensional HWI inequality (26) is another path towards dimensional logarithmic Sobolev inequalities. 
Here we obtain : 

Let p, have density e~ v where V is C 2 and satisfies Hess(U) > Rid™ with R > 0. Then 

H(v l/r) < v{V) - »(V) + n log (l + ± + p(V) - v(V)) ) 

for all v. Equivalently, in terms of deficit, 

&lsi{v\p) > R max (v{V) - p{V) - g(i/|/r)), A n ( 7 ^^ - v(V) + /i(V))j|. (33) 

In the Gaussian case, then R = 1 and we obtain a bound which is slightly worse than (5), where a 
log(l + 2 u) term is replaced by the larger 2 log(l + u). 

At this point, let us observe that still in the Gaussian case a dimensional HWI has been derived in [8, 
Th. 1.1], It is also observed by the authors that the HWI inequality in [8] does not seem to imply (5). We 
could not compare the HWI in [8] to our bound (26) in full generality. However, if ^(|a:| 2 ) = n = 7 (|x| 2 ) 
then they can respectively be written as 

2h < x — y + log(l + x) and h < log(l + x — y/2) 

for x = W‘i\fl [n, y = Wf/n and h = H/n; hence our bound is at least significantly more precise in the 

common range I W 2 ~ 1: indeed then x y ~ 1 in this range, so that comparing the two right-hand 

sides amounts to x log(l + x). 

As remarked in [8, 22] it is also possible to get refined logarithmic Sobolev inequalities by combining the 
HWI and Talagrand inequalities. Here, if Hess(U) > Rld n with R > 0, then (26) can be written as 

H + 6 n {-h) < W 2 Vl - |Wf (34) 


where h = H + p(V) — v{V). Moreover H = ^Wf + Srai , so 

6Tal < VJ-RW 2 . 

W 2 


Then, by (34) again and Theorem 2.1, 

Sf.si = \l-RH> RS n (—h) + i (Vi- RW 2 ) 2 


In particular this improves upon the first lower bound in (33). Let us recall that the function S n is defined 
above, after Theorem 2.1. 

Refined Gaussian logarithmic Sobolev inequalities have been considered for certain classes of test mea¬ 
sures v : measures v satisfying lower and upper curvature bounds as in [8] and [32] , measures v satisfying 
a (weaker) Poincare inequality as in [22]. Under these additional assumptions on v , the goal is then to 


> R6 n ( — h) + 


> RS n ( — h) + 


1 (<$Tal + 5 n { — h)) 2 

2 Wf 

1 {S n (h) + *$„,(-fe)) 2 

2 Wf 
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obtain better constants in the logarithmic Sobolev inequality, mimicking in a sense the phenomenon ob¬ 
served in the Poincare inequality when considering test functions orthogonal to the first eigenfunctions. In 
Indrei-Marcon [32] , the deficit is controlled by the Wasserstein distance for the class of centered functions 
with upper and lower bounded curvature. The authors in [ 8 ] also give new bounds in terms of condition¬ 
ally centered vectors. Further improvements are given in [22] in terms of the Wi t i distance defined in 
Section 2.2. Here again our bounds share the advantages of holding without any smoothness, centering, 
etc. hypothesis on v, and of having the good dimensional behaviour when considering product measures. 


3 Applications to Fokker-Planck equations 

Let us now see how our results (or methods) lead to short-time smoothing of the entropy and improved 
contraction rates for the laws of solutions to stochastic differential equations. 

For this, let again b be a C 2 function on R" such that J e -V = 1 and Hess(H) > i?Id n , with R possibly 
negative, and satisfying the doubling condition V(x + y) < C( 1 + V(x) + V{y)) for a C and all x,y. Let 
also y be the probability measure with density e -V . We let uq in P 2 (R") and consider gradient flow 
solutions u = {u t )t >o £ C([0,+oo), P 2 (R")) of the Fokker-Planck equation 

r\ 

= Au t + V • (utVV), t> 0, a; € r (35) 

at 

as in [1, Chap. 11.2.1] and [18, Th. 4.20 and 4.21] (see also [33]). Equation (35) holds in the sense of 
distributions. Moreover, by [1, Th. 11.2.8] or again [18], for any t > 0 the solution u t has a density; 
for almost every t > 0 this density is in (R™), with Vu t /u t + VV £ L 2 {u t )\ finally t >->• I(u t \y) £ 
L Lc (]°, +oo[) and 

^H(u t \n) = -I(ut \y) 

for almost every t > 0. The solution lit. can be seen as the law at time t of the solution (X t )t>o to the 
stochastic differential equation 

dX t = V2dB t - VV{X t ) dt. 

Here (B t )t >o is a standard Brownian motion on K™ and the initial datum Xq has law uq. 

Moreover, the interpretation of (35) as the gradient flow of H(- |/z) on the space P 2 (R n ) has enabled to 
obtain the following short-time and contraction properties (see [1, Th. 11.2.1] and [39, Chap. 24]). Let u 
and v be solutions to (35). Then 

H[Ut\H) < M^l^l e 2max{-fl,0} t) t > Q ( 36 ) 

and 

W 2 (u t ,v t ) < e~ m W 2 (uo,vo), t> 0 . (37) 

In particular, if R > 0, then ut converges to the steady state y as 

W 2 (u t ,iJ,)<e- Rt W2(uo, f i), t> 0. (38) 

The purpose of this section is to improve these three properties by means of the tools and inequalities in 
the above sections. 

3.1 Short-time smoothing of the entropy 

In the Gaussian case where y is the standard Gaussian measure 7 , the solution to (35) is given by the 
Meliler formula (see [4, Sect. 2.7.1]). In particular the fundamental solution, with initial datum uq the 
Dirac mass at 0, is at time t > 0 the Gaussian measure with variance of = 1 — e _2t : 

u t {x) = ( 27 R 7 t 2 )- n / 2 e- x2 /( 2ff *\ 2 £ IT. 

Its relative entropy can be computed as 

H{u t \l)= f u t (x) log ^j-^-dx = ~ w [e -2t + log(l — e _2t )]. 

7W 2 
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Of course this is coherent with (36), with R = 1, since 


by direct computation. In fact, for t ~ 0 one can observe that 

H{u t \i) ~ \ lo g^- 

On the other hand, let u be a solution to (35), still in the Gaussian case, and with initial datum uq such 
that mo(M 2 ) = n = 7 (|:r| 2 ). Then u t (|:c| 2 ) = n for all t since 


d 

dt 


|x| 2 dut = 2n — 2 


\x\ 2 du t . 


(39) 


In particular, in the notation H(t) = H(ut\”f)/n and I{t ) = I(ut\"/)/n, the dimensional Gaussian loga¬ 
rithmic Sobolev inequality (5) simplifies as 2H < log(l + 1). Hence 

H'(t) = —I(t ) < 1 — e 2H ^\ for a.e. t > 0. 

By the change of variable x(t) = e~ 2h ( t ' > this integrates into 

x(t)e 2t > *(0) + e 2 ‘ - 1 > e 2t - 1. 

In other words 

H(uth)<-^og(l-e~ 2t ), t> 0 
which gives the same short-time behaviour. 


More generally : 


Proposition 3.1 Let u be a solution to (35) with Hess(H) > Rld n ,R > 0, and with initial condition u,$ 
in P 2 (R n )- Let T > 0 and assume that ut( |VH| 2 ) < M for t in [0,T]. Then there exists a constant c > 0 
depending only on n, R and M such that 

H(u t \iT) < max|l,-log-|, t < T. 

Remark 3.2 The moment assumption Ut{ |VH| 2 ) < M for t in [0,T], is not a restrictive condition. It 
can indeed be checked by time differentiating zij(|VH| 2 ) and controlling its non explosion via a Lyapunov 
type condition on uoe v or on derivatives ofV for instance. 

It can also be checked by observing that the Markov semigroup {Pt)t >o with generator L = A — Vb • V is 
such that f (f>dut = f Ptfiduo for any test function <f>. In particular, if Q is a convex function and if the 
initial datum has a density also denoted uq, then 


u t {\S7V\ 2 ) = j |W| 2 du t = J P t {\\JV\ 2 )du 0 = J P t (\VV\ 2 )u 0 e v dp 

< j <S>(P t (\VV\ 2 ))dp + j $*(u 0 e v )dp < J $(|W| 2 )d/z + J <f>*(u 0 e v )dp. 


Here we use the fact that t i —> J $(P 4 (| W| 2 ))dp, is non increasing since $ is convex. The moment 
assumption is then satisfied for all T > 0 as soon as the right hand side is finite for a convex function $. 

Proof 

<3 We shall let c denote diverse positive constants depending only on n, M and R. By Corollary 1.4 
applied to the measure f 2 p = Ut, and integration by parts, there holds 

H{u t \p) < n{s - 1 - logs) + |W| 2 ) + SIkS r 1] u t (AV) + l^I{ Ut \p) 
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for t > 0 and s > 0. Recall that / has been introduced in (2). Since V is convex, then AV > 0 and then 

s 2 

H{u t \n) < -n logs + c+ — I(u t \[i) 


for all s e]0,1] and t S]0,T]. 

Now, as far as H(t) := H{ut\n) > 1, then /(f) := I[ut |/r) > 2 R so that s = ^/2R/I is smaller than 1. For 
this s we obtain 

TL 

H < c + - log/. 

Hence 

H'(t) = -I{t) < - e 2ff(t) / n - c 

for almost every t > 0. As above x(t ) = e~ 2H ^ n satisfies x(t) > x(0) +ct > ct by time integration. Written 
in terms of H, this concludes the proof. > 


3.2 Refined contraction properties 

Let us now see how to make (37) finer. Still by [1, Th. 8. 3. 1] and [18, Th. 4.20 and 4.21], one can write 
(35) as the continuity equation 

f) 

-kt + V • (£M«t) = 0, f>0,iel" 


with £[ut] = — VV — V log ut- Then for almost every t > 0 
\ d f 

~2dt W ^ Ut,Vt ^ = j ‘ ( v< Mz) ~x)ut{x)dx (40) 

Aipt(x) + Atp*(Vtpt(x)) — 2n + (W(V</?t(a;)) — VF(&)) • (Vy>t( x) — x) ut(x) dx (41) 


> 


for two solutions u and v. Here ipt is the convex map such that Vt = S7ipt#ut and ut = for 

the Legendre transform of ip t (see Section 2.3). Equality (40) follows from [39, Th. 23.9] (see also [1, 
Th. 8.4.7]); its assumptions are satisfied since (and likewise for v) 



pt2 

|£[u s ] \ 2 du s ds = / I(u s \n)ds = H{u tl \n) - H(u t2 \n) < H(u tl \n) 

Jtx 


which is finite for any ti> t\> 0, as observed above. Inequality (41) follows from a weak integration by 
parts, as in [33, Th. 1.5]; there again A ip t is the trace of the Alexandrov Hessian of tpt. 

Now, for given t > 0 and iq-almost every x, the symmetric matrix Hess(<pt)(a;) is positive, as recalled in Sec¬ 
tion 2.3: letting e 2Ai ( x ) for i = 1,..., n its n positive eigenvalues , then its inverse matrix Hess(<^J')(Vv?t(x)) 
(see again Section 2.3) has eigenvalues e ~ 2X d x )^ hence at point x 


A(fi t + Aip*(V(pt) ~ 2 n = tr[Hess(</? t )] +tr[Hess(</?j )(V(^ t )] —2 n = ^ (e 2Ai +e 2Ai — 2) = 4 ^ sinh 2 (A,;)■ 

i i 

(42) 

Hence, by convexity of sinh 2 and the Jensen inequality, and (32), 


Aiptix) + Aipt (Vipt(x)) — 2n uAx) dx = An — / sinh 2 (A i(x))uAx)dx 

J n^J 

> An sinh 2 [ — [ Xi(x)ut(x)dx 

\ n iJ j 

= 4rasinh 2 j log det Hess(<^t)(a;) u t (x) dx 

f EntdzC^t) - Ent dx (it t ) 


= An sinh' 


2n 
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Since Hess(P) > i?Id n , we obtain 


2 . 2 fEnt dx (v t ) -Ent dx (w t )\ 2 

- 2~dt W ‘ 2 ' Ut ' Vt ) ~ 4nsmh l-2n- )+i?W 2 ('Ut,^t). (43) 

By time integration this ensures the following dimensional contraction property : 

Proposition 3.3 In the above notation, ti/Hess(P) > i? Id n for R £ R, then for any solutions to (35) 

W 2 (u t ,v t ) < e~ 2Rt W 2 (u 0 ,v 0 ) - 8nJ* e ~ 2R ^ sinh 2 | En W(p.)~ E ntdx(«,) j d . 8> t > 0 . (44) 

For the heat equation, namely for V = 0, then the associated Markov generator L = A satisfies the 
CD( 0, n) curvature-dimension condition: in particular in this case the bound (44) has been derived in [13] 
and [14], and is also a consequence of [21]. For V ^ 0, then the associated generator L = A — VP • V 
satisfies a CD(R, oo) but no CD(R,n) condition: in particular the bound (44) can not be obtained from 
the works mentioned above. 

Remark 3.4 The above computation can be extended to drifts A{x ) which are not gradients. In this case 
the assumption Hess(P) > i?Id„ should be replaced by the monotonicity condition (A(y) — A(x)) ■ (y — x) > 
R\y — x\ 2 for all x,y (see [12] for this non-gradient case). 


3.3 A formal gradient flow argument to Proposition 3.3 

In this subsection, we provide an alternative formal argument to Proposition 3.3 based on gradient flow. 
We begin with the following elementary lemma which gives additional information to [21, Lem. 2.2]. 


Lemma 3.5 Let ip be a C 2 function on [0,1]. Then the following properties are equivalent: 

• ip" > ip' 2 /n; 

• for all r, s in [0,1], 

ip(r) — ip(s) 

n — ip'(r)(s — r) > ne " ; 

• for all r, s in [0,1], 

(ip'(a) - ip'(r )) (a-r)> 4n sinh 2 . 


Proof 

< Let indeed U = so that 



Then ip" > ip' 2 /n if and only if U is concave, hence if and only 
e -^r- = U(s) < U(r) + U'(r)(s — r) = 


iP'(r) _ 

-e 

n 


- (s 


r) 


(45) 

(46) 


for all r,sG [0,1], which is (45) when multiplying both sides by e^^/ 71 . 

Adding (45) with the corresponding bound obtained with r,s instead of s,r leads to (46). Conversely, 
dividing (46) by (s — r) 2 and letting s go to r gives ip" > ip' 2 jn at point r. > 


Let now p° and p 1 be absolutely continuous measures in P 2 (R”), \7ip their Brenier map and (p s ) S £[o,i] 
the geodesic between them, as in Section 2.3. Here again we identify the measures with their densities. 
Let us now recall why the function ip : s i—>■ Ent dx{p s ) formally satisfies ip" > ip ' 2 /n on [0,1]. For this, 
recall from Section 2.3 that for /i°-almost every x the Alexandrov Hessian Hess(</?)(a;) is positive, so that 
the eigenvalues 6i{x) of Hess(y>)(a;) — I are > —1. Writing (32) with the measures po = p° and p\ = p a , 
we obtain 


ip{ 0) = ip{s) + J logdet(/ + s(Hess(y>)(x) - /)) dp°(x) = ip(s) + ^ j log(l + s6i(x)) dp°(a 
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Hence 


V>'(s) = -Y1 

i 

and then by the Cauchy-Schwarz inequality 


ei 


1 + sO, 


■dp? 


(47) 


ip"(s) = n — Y [ —— 2 dfj° > n [ d(i°\ = —if'(s) 2 . 

V n z -'J (1 + s8i) 2 p \n ^ J 1 + s6i j n w 

Remark 3.6 Identity (47) can also be formally checked using the continuity equation solved by (/^ s )sg[o,i] ■' 

dp 3 


ds 


+ V • (p s v s ) = 0. 


Here the vector field v s satisfies v s (x + s(Vip(x) — x)) = V<p(x) — x, see e. g. [38, Th. 5.51]. For, and 
recalling that ip(s) = f p 3 logp 3 dx 

<"V> d * = -/ V .„>■ dx = -/ (V . „*)(x + .(V VW - x))**(•) 

by integration by parts and since (x + s(Vp(x) — x))f[p 0 = p 3 . Identity (47) follows since by chain rule 
(V • v 3 )(x + s(Vv?(: r) — x)) = tr (Hess((/?)(x) — /)(/ + s(Hess(<^)(a;) — /)) 1 = ^ \^ l Q 

i 1 

Remark 3.7 In the above notation, observe that (45) in Lemma 3.5 for if(s) = Entd x (p 3 ),r = 0 and 
s = 1 formally leads to (31) in Lemma 2.6. For, in the notation of Remark 3.6 and by integration by parts, 

ijj'( 0) = J Xp°-v°dx = J X7p° • (Vip — x)dx = n — J Aipdp 0 . 


We can now deduce an alternative formal argument to the bound in Proposition 3.3. 

We begin with the following classical observation in Euclidean space : Let X and Y be two solutions of 
the Euclidean gradient flow X[ = —VP(X t ) in R d , where V : — > R is a smooth potential. For t > 0 
let Ut{s) = U(X t + s(Y t — X t )) for s £ [0,1]. Then 

- ~\Yt - Xt I 2 = (Xt - Xt) • ( VU(Y t ) - VU(Xt)) = UK 1) - um- (48) 

Let now u and v two solutions to the Fokker-Planck equation (35), which by [1, Chap. 11.2] and [39, 
Chap. 23] is the gradient flow of H(- |/z) on the space P 2 (M"). For any t > 0, let Vift be the optimal 
transport map between ut and Vt, and (Mt)se[o,il be the geodesic path in P 2 (R") between ut and vt, as in 
Section 2.3. Then, formally and by analogy with (48), 

-~W 2 (ut,v t ) = E' t (l)-E'M (49) 

where for given t > 0 we let 

E t {s) = H(p s t \p) = Ent dx (p s t ) + J V dp s t . 

Indeed, let Ent dx (Pt) f° r gi ven t and, for each x let the matrix Hess(</? t )(x) have eigenvalues e 2Xi . 

Then, in the above notation 6i = e 2A< — 1, (47) for p = Ut gives 

^(l)-^'(O) = 
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as in (42). Using moreover the formal derivative 


d_ 

ds 



d_ 

ds 


V(x + s(Vipt(x) — x))du,t(x) = J VU (x + s(Vipt{x) — x)) ■ (Vtpt{x) — x) dut{x) 


for s = 0,1 we formally recover (41) in (49). 

We now use the fact that for given t the function %p satisfies ip" > ip' 2 /n on [0,1]. Then, by (46) in Lemma 
3.5 for r = 0 and s = 1 we obtain 

E' t { 1) - E' t ( 0) > 4nsinh 2 ^ Entdx ^* \^ nt ' dl ^ U1 ^ + J (VU(V< p t (x)) - VU(x)) • (Vy> t (x) - x) du t (x). 


Since f |Vy> t (x) — x\ 2 du t {x) = W 2 (u t ,Vt) this leads to (43) and then to (44) as soon as Hess(U) > i?Id n . 


3.4 Improved convergence rates 

In this section we consider a solution u to (35) in the Gaussian case where /r = 7 , and for which we can 
take R = 1 above. Let us see how the contraction property (44) can make the convergence estimate (37) 
more precise. 

For simplicity we assume that uo(M 2 ) <n = 7 (|x| 2 ). Then u t (|x| 2 ) < n for all t, by (39). Hence (29) and 
the Talagrand inequality (22) ensure that 0 < W 2 put,^) < 2 n and 

Ent dx {u t ) - Ent d x(7) > _ 1q G _ W|K ! 7 )\ 

Ti \ 2 n ' 

In particular the right-hand side is non negative. Moreover, for the stationary solution vt = v 0 = 7 , the 
contraction property (44) with R = 1, in the form (43), implies 

/ x2 

—x > - -h 2 x 

1 — x 


where x(t) = H / |(M t ,7)/(2n) G [0,1). Here we use that sinh(logx) = (x — l/x)/2. In other words 
z(t) = 1 — (1 — x(t)) 2 satisfies z' < —2 z. This integrates into z(t) < e~ 2t z( 0), that is, 


x(t) < 1 


1 

^1 — (2x(0) — x(0) 2 )e -2 ^ 2 . 


(50) 


By the lower bound 


1 - (2x(0) - x(0) 2 )e" 2t > (1 - x(0)e" 2t ) 2 


it implies the classical bound (37). It also improves it: for instance (50) can be written as 


(51) 


W 2 2 (u t , 7 ) < W 2 2 (w 0 , 7 )e 2t - 

1 + 


2 — x(0) 

T • 

(l- (2x(0)-x(0) 2 )e- 2 *) 5 


Then by (51) we obtain 

Corollary 3.8 In the above notation, let u be a solution to (35) in the Gaussian case, with initial datum 
u 0 such that uo(|x| 2 ) < n. Then for all t > 0 


W 2 2 K, 7 ) < W 2 (u 0 ,7)e 24 


1 - IT 2 2 (M 0 ,7)/(4n) 

1 - IU 2 2 (Mo,7)e -2 7(4n)' 


Observe that the quotient is smaller than 1. 


Remark 3.9 The Gaussian assumption is used here only to ensure uniform convexity of the potential 
(hence the Talagrand inequality), and that f Vdut < / Ve~ v dx as soon as this holds at t = 0. 


18 








4 Brascamp-Lieb inequalities 

It is classical that linearizing a logarithmic Sobolev inequality leads to a Poincare inequality, which in the 
Gaussian case is the Brascamp-Lieb inequality. In this section we shall see how to obtain two different 
dimensional Brascamp-Lieb inequalities: a first one by an improvement of the classical L 2 method, and a 
second one by linearization in the Borell-Brascamp-Lieb inequality (13). 


4.1 Brascamp-Lieb inequality by L 2 method 

Proposition 4.1 (Dimensional Brascamp-Lieb inequality I) Let p be a probability measure on R n 
with density e~ v where V is a C 2 function satisfying Hess(H) > 0. Then 


Var M (/) < 


V/-Hess(V) _ 1 V/d^ 


(fVfdn-fVdpJfduy 

n — Var M (P) 


(52) 


for all C l compactly supported functions f. 


Remark 4.2 V. H. Nguyen [35] has proven that Var^(P) < n for V convex. We will observe in the proof 
that even Var^fV) < n as soon as Hess(H) > 0. In fact, it follows from the bound (52) for f — V that 
Var^{V) < < n where I = f VV ■ Hess(P ) -1 VV dp. In particular, if Rld n < Hess(H) < STd n , then 

I < Rf |VP| 2 dp = i ? _1 f A V dp < nS/R and Var M (P) < -^rg- The latter inequality is an equality (to 
n/2) for the Gaussian measure with any variance, for which R = S. 

If /i = 7 is the standard Gaussian measure then (52) is exactly the dimensional (Poincare) inequality (7) 
(and in particular equality holds for / = \x\ 2 /2). 

In the non Gaussian case, G. Harge has derived the following improvement of the Brascamp-Lieb inequality, 
see [31, Th. 1] : if V is a C 2 function satisfying t?Id„ < Hess(V r ) < S'Idn for constants 0 < R< S, then 

Var M (/) < J V/ • Hess (Vy'Vfd/jL - 1 + ^ /S 

for all /. 

We do not know in full generality which of the coefficients (n — Var^fV)) -1 and n -1 (l + R/S) in the 
corrective terms of (52) and (53) is the larger. 

Besides being equal (to 2n -1 ) in the Gaussian case, both coefficients are always larger than n -1 . More 
precisely the coefficient in (52) is always strictly larger than n -1 whereas the coefficient in (53) is n -1 
when R = 0 (no uniform convexity) or S = +00 (no upper bound on Hess(H)): hence at least in these 
cases our bound is stronger. 

The bound (53) has been obtained in [31] by a L 2 argument. We shall see in the appendix that it can be 
formally recovered by linearization in the Monge-Ampere equation. 


V f dp— / V dp 


(53) 


Proof of Proposition 4.1. Let uj be in the space C£° of C°° and compactly supported functions. Then 

2 


J HHessHH^s dp - (/ A ui dp^j >0 


by the Cauchy-Schwarz inequality; here ||Hess(w)||^ s = •_i(3iju ;) 2 is the squared Hilbert-Schmidt 

norm of the matrix Hess(w) = (dijLo)ij. In other words 

J Vu • Hess(H)Vcj dp < J (||Hess(o;)||| ?i g - 1 - Vw • Hess(H)Vw) dp — — (^j A ivdp'j . (54) 

Moreover, by integration by parts, 

J (||Hess(u;)||^ s + Vu • Hess(H)Vw) dp = J (Leo) 2 dp, J Aw dp = — J VLuidp (55) 
with L = A — VV ■ V, see [4, Section 3.2]. 
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Let now / be a C 1 compactly supported function. Then pointwise 

2 V/ •Vw<Vw Hess(V)Vu; + V/ • Hess (V) _ 1 V/. 

From these remarks, inequality (54) implies 

2 J V/ ■ Wujdp < J V/ ■ Hess(V) _ 1 V/d/x + J ( Luj) 2 dp — — (^J VLudp 

Let now h = —Lui. Then j V/Vw dp = — f fLui dp = J fh dp by integration by parts. 

To sum up, we have obtained 

2 J fhdp< J V/ • Hess(V) - 1 V/ dp + J h 2 dp — ~(^J Vhdp)j (56) 

for any h in L(C^°) and any C 1 compactly supported function /. 

But, by [31, Lem. 9] for instance, L(C£°) is dense (for the L 2 {p) norm) in the space of functions h £ L 2 (p) 
such that / hdp = 0. Hence, formula (56) extends to any h £ L 2 (p) such that / hdp = 0. 

In particular, given a C 1 compactly supported function / such that J f dp = 0 , we can apply (56) to 
h = f + a(V — f V dp) with a £ R. Observe indeed that V £ L 2 (p) for p = e - ' with V convex. We get 


f z dp < / V/ • Hess(V) L Vfdp + Ia 2 — 2a 


Var / 1 (H) 


Vfdp~- 

n 


for all a, where / = Var /i (V)(n — Var / 1 (V))/n. Necessarily / is positive, that is, Var M (P) < n. Indeed, if 
I was non positive, then the left-hand side would be —oo by letting a tend to ±oo, which is impossible. 

We finally optimise over a, choosing a = J Vf dp/(n — Var^fV)). This concludes the proof of Proposi¬ 
tion 4.1 for any / such that f f dp = 0, and then for any /. > 


4.2 Brascamp-Lieb inequality via the Borell-Brascamp-Lieb inequality 

The following result gives an improved version of the Brascamp-Lieb inequality (4) from the Borell- 
Brascamp-Lieb inequality. 


Theorem 4.3 (Dimensional Brascamp-Lieb inequality II) Let p be a probability measure on K™ 
with density e~ v where V is a C 2 function satisfying Hess(V) > 0. Then for any C 1 and compactly 
supported function f such that J fdp = 0, 


Var M (/) < 


V/ • Hess(V ) -1 V/ dp 


(/- V/• Hess(V )- 1 W ) 2 
n + VH-Hess(V )^ 1 W 


(57) 


Theorem 4.3 is proved in Appendix B. 

For the standard Gaussian measure, we obtain 

Corollary 4.4 The Gaussian measure 7 satisfies the dimensional Poincare inequality 

Var 7 (/) < J | V /| 2 d-y-J (/ ~ d 7 (58) 

for any C 1 and compactly supported function f such that f fdj = 0. 

By the Cauchy-Schwarz inequality and integration by part, 

f (/- V/~ x) 2 d ^ (f V / • xd 7 ) 2 _ (f A/riy ) 2 _ (f f\xj 2 /2dj) 2 
J n + \x\ 2 ^ ~ 2n 2 n n — Var 7 (|x| 2 / 2 ) 

Therefore, for the Gaussian measure, inequality (58) is stronger than (7) mentionned in the introduction 
(and naturally equality still holds for / = |cc| 2 /2). 
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4.3 Comparison of Brascamp-Lieb inequalities 

Many dimensional Brascamp-Lieb inequalities have recently been proved, and should be compared. We 
have already compared our inequality (52) with G. Harge’s bound, as the same covariance term appears. 
Let us now compare (57) with other inequalities. It seems difficult to obtain a global comparison and we 
are only able to give partial answers or hints. 

• The present paper proposes the two inequalities (52) and (57). In the Gaussian case we have already 
observed that (57)-(58) is stronger than (52). A variant of this argument shows that it is also the 
case for instance when V(x ) = x 2a + /3, x £ 1 with a £ N* and a normalisation constant /?. We 
believe that it is the case for any V since the additional term in (52) vanishes for functions / for 
which the one in (57) does not. 

In fact, for a C 1 function / such f /e~' = 0, the additional term in (57) vanishes if and only if there 
exists a £ 1” such that f = a ■ W (and then a = J f(x)xe~ v ^). For, if / = V/Hess(V) -1 W 
on R", then g(y) = f(VV*(y)) solves g(y) = Vg(y) ■ y on R". Hence for fixed y £ R” the map 
1 1 ->- g(ty)/t is constant; for t = 1 and t —> 0 this implies g(y) = V.g(0) • y. This finally gives /, and 
conversely. But it is classical that these functions / are exactly those for which equality holds in the 
Brascamp-Lieb inequality (4). Hence the additional term in (57) can be seen as a (weighted) way 
of measuring the distance of a function to the optimisers in the Brascamp-Lieb inequality (4). 

Very recently, and under the same hypothesis as in Theorem 4.3, D. Cordero-Erausquin in [17, 
Prop. 6 ] proved that 

Var^f) < J V/ ■ Hess(V ) -1 V/ dg — c\(g) J Hess(V) _ 1 (Hess(V) + cA(/u)Id„) - 1 V/o • \7 fodg (59) 

for all / satisfying f fdg = 0 ; here fo = f — J yf(y)dg(y) ■ W, c is a numerical constant and 
X(g) is the Poincare constant of the measure g. The additional term in (59) vanishes if and only 
if fo is a constant, so also appears here as a distance to the optimisers. A quantitative comparison 
between (57) and (59) can not easily be performed as in particular a numerical constant appears 
in (59). After the present work was completed, M. Arnaudon, M. Bonnefont and A. Joulin [2] have 
derived Brascamp-Lieb inequalities in which the energy has been modified, instead of keeping the 
original energy and allowing for a remainder term, as here. We could not compare their results with 
ours. 

• We now turn to the Gaussian case when g = 7 . We have already observed that (57) is stronger 
that (52), which is exactly (7). On the other hand, ( 8 ) is a purely spectral inequality. We have 
numerically checked that (57) implies ( 8 ) for the Hermite polynomial functions Hk, k £ {1, • • • , 7}. 
We believe that it is the case for all functions, but we do not have a proof of it. 

Let us conclude by mentioning the inequality 

Var-f(f) <6 J |V/| 2 d 7 - 6 J d'y. 

has been proved in [11, Sect. 2]. Their extremal functions have been lost since there is no equality 
when f{x) = a ■ x and the constant in front of the energy is larger than in our bounds. 


A Proof of Theorem 1.1 


Optimality of inequality (15). When W is strictly convex and satisfies (HI), then (14) holds, so that 


W*(VW) 

W n+1 


dx = 


VW ■ x—W 
W n+1 


dx = — 


- [ V (W~ n ) ■ xdx — 1 = 0. 

n J 


In the last equality, we used an integration by parts, valid from hypothesis (HI) satisfied by W. This 
gives the equality case in (15) when g = W. 

Proof of inequality (15). Globally, the proof follows [10], but for completeness we give its main points. 
It is based on a Taylor expansion of the inequality f H t dx > 1, when t = 1 — s goes to 1, and where H t 
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is defined in (16). Equivalently, this inequality can be written as 


/ ( 9 (f ‘ l h ) + l w{h) }) dz - L 

Changing variables in the integral by letting x = z/t, and letting u = s/t, the inequality becomes 
J (p~ n dx > 1 for any u > 0 , where for positive u 

<p u {x) = inf {g(x — uh)+uW(h)}. 

/iGlR n 


Since f g n dx = 1, this is 


Vu 


-dx > 0 


for any u > 0. The main goal is now to consider the limit as u —> 0, by computing the limit 


lim 

u —>-0 


/ 




-dx. 


(60) 


Lemma A.l For any x € R n , 


lim Fu(x)-g{x) 
ti-> 0 + u 


—W*(S7g{xj). 


Proof 

< For any x £ R", from the definition of <p u , we have for any h £ R”, 

~ 9(x) < 9{x ~ uh) - 9(I) + W(k) = -V 9 (x) • h + W(h) + o(»). 


It follows that limsup„_ >0 + < —’S7g{x) ■ h + W{h) for any h, and then by taking the infknum 

over feel", 

<Pu(x) - g{x) 


lim sup ■ 

u—^ 0+ 


< -W*(yg(x)). 


Now, one can observe that 


<Pu(x) 


. < M x ~ uh ) +uW(h)} : 

h, uW(h)<g(x) 


so that 
g(x) - <Pu(x) 


= sup 

h, uW(h)<g(x ) 


g(x) - g(x - uh ) 


— W(h)f< sup {S7g(x) ■ h + \h\e(u\h\) — W(h)}, 

h, uW(h)<g(x) 

(61) 


where e is an appropriate function satisfying lim„_>o s(u) = 0 . 

Let now r = sup{u|/i|, uW(h) < g{x)}. From the hypothesis (HI), 


r < 


sup ^u\h\- < 5 (x)| < D\Jug(x), 


(62) 


where D is a constant. Generally, D denotes a constant and can change from line to line. The bound (62) 
gives 

g{x) - <p u {x) 


< 


sup $Vg(x) ■ h+\h\e(Dy/ug(x)) — W(h)\. 


h , uW(h)<g(x) 


Let now 77 > 0. Then there exists uq > 0 such that Vu £ (0, wo], £ ^Dy/ug(xfj < 77 , so that 

^ u ^ < sup {Vg(x) ■ h + \h\r/— W(h)} < sup {Vg(x) ■ h + \h\g — W(h)}. 

H h, uW(h)<g(x) h€M n 
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By (HI) the supremum is reached, say on a ball of center 0 and radius R > 0 independent of g < 1. Hence 


^< sup {Wg{x) ■ h - W{h)} + Rg = W*{S7g{x)) + Rg. 
u he i" 

The result follows by taking the superior limit and then letting g go to 0. O 


To compute the limit (60), we use the dominated convergence theorem. Since everywhere ip ' u 9 — goes 

to n w n+i 9 ^ when u —> 0, we only need to give a uniform bound (in u) of the quantity of ———. 

For any 0 < a < 6, the following holds | a~ n — b~ n \ < n\a — b\a~ 1 ^ n . Since 0 < tp u (x ) < g(x) by definition 
of (fi u , we can apply this inequality to a = g> u {x) and b = g(x), obtaining 


Vu(x) n -g(x) " 


Vu{x) - g(x) 

U 


U 


Bound on | ip u (x) — g(x)\/u: 

First, from the equality in (61) and a Taylor expansion, 

^< sup {T>g(x,u\h\)\h\ - W(h)}, 

^ h, uW{h)<g(x ) 

where Vg(x,s ) = sup^.^^g |V.g(j/)|. We assume now that u e]0,1]. Then, from (62), r < D^/g(x). 
Hence, by (HI), 

- <Pu ^ < sup {Vg(x,Dy/g(xj)\h\-W(h)\ < sup lvg(x,Dy/g(x)}\h\-^^\. 

u h, uW(h)<g(x)'~ \ / J ftgl" IV ' ^ J 

The explicit computation of the infimum gives 

< DVs ( x , Dv ^y. 

Then, from the hypothesis (H2), the estimation of Vg gives the bound 

Q < 9 ^ ~ < D{ \x\ 2 + 1), «e]0,l],iGR n . 


Bound on <p u (x): 

From the hypotheses (HI) and (H2) we have 

<Pu(x) 2 + U 2C r } + ^' 

When u G]0,1], the explicit computation of the infimum gives again 

<Pu{x) > D(\x\ 2 + 1). 

Finally, we have obtained the upper bound 

< D(\x\ 2 + I)"”, «e]0,l],ieP. 

The dominated convergence theorem can then be applied. The proof of Theorem 1.1 is then complete. 


<Pu(x)~ n — g{x)~ n 

U 
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B Proof of Theorem 4.3 


We adapt the argument of [9]. 

We will assume throughout the proof that V is C 3 with bounded derivatives V 2 F and V 3 F, and that 
there exists p > 0 such that uniformly in K™, Hess(V) > pld„. Then the result extends to V as in the 
Theorem by approximation. 

Let / be a C 1 compactly supported function satisfying f fdp, = 0. We apply the Borell-Brascamp-Lieb 
inequality (13) for t = s = 1/2, F = exp(— V), G = exp(2<5/ — V)/Zg (S > 0) where Zg = f exp(26f)dp, 
and finally H = exp (<fig — V) where 

4>s(z) = —nlog inf ( Z] /n exp f-—f(z + h) + + -(- exp j+nlog(2) +V(z). (63) 

/i€R n ( \ n n ) \ n /J 

Then (13) ensures that J e^ s dp > 1. The rest of the proof is devoted to a Taylor expansion of f exp (cj)g)dp 
as S goes to 0. 

By convexity of V, for any S > 0 the function in (63) to be minimised is coercive, so indeed admits a 
(possibly non unique) minimiser, which we first estimate by giving a Taylor expansion as 6 —> 0. 

For this, let S > 0 be given and let hg be any minimiser. Then 

Z\ jn [^- 2 5X7 f(z + hg) + X7V(z + hg )) exp ^-^-f(z + h s ) + ^F(z + hg) 

= X7V(z — hg) exp V{z — hg)^ . (64) 

1. In a first step we prove that hg = 0(6) uniformly in z: in other words, there exists a constant C > 0 
such that for any 5 small enough and any z £ R", 

\hg\<C6. 

For this, first, since f fdp = 0 and / is compactly supported, 



Let us now assume that the support of / is included in the ball {|ir| < R} with R > 0. There are two 
cases, depending on whether \z + hg\ > R or |z + hg\ < R. 


• First, assume that | z + hg\ > R. Then equation (64) becomes 

Zy ,l X7V(z + hg) exp (J^V( Z + hg)^j = X7V(z - hg) exp ^V(2 - hg)^j . 

In other words, by (65) and taking the scalar product by hg, 

(^- J f 2 dp + o(6 2 )j V$(z + hg) ■ hg = X7$(z - hg) ■ hg - V$(~ + hg) ■ hg 
where $ = exp(iF), that is, 

— (~—J f 2 dh + °(^ 2 )) W(z + hg) ■ hg = e _l G+ h s) j hg ■ Hess($)(z + thg)hgdt. 


Now Hess(F) > pld n so 

e v i e v i 

Hess($) = —(Hess(F) + -W <g> VF) > — (p Id„ + -VF® W). 
n n n n 


( 66 ) 
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Hence 


-( 2 S 2 J f 2 d[x + o(S 2 )^V(z + hs)-h s >e- v ( z+h ^ j (p\hg\ 2 + ^-\VV{z + thg) ■ h s \ 2 y nz+ths) dt. 

(67) 

In particular, W {z + hg) ■ hg < 0 on the left-hand side for S small enough, independently of z since 
the o(S 2 ) comes from Zg, see (65), and is uniform in z; hence for any t £ [—1,1] 

V(z + thg) — V{z + hg) > (t — 1) VV(z + hg) ■ hg > 0 

by convexity of V. Moreover W (z + thg ) • hg < X7V(z + hg) ■ hg again by convexity, whence 

|VV (z + thg ) • hg\ > | VV(z + hg) ■ h 5 \. 

Collecting all terms, (67) leads to 

(2 S 2 J f 2 dp + o(S 2 )'j \S7V (z + hg) ■ hg\ > 2p\hg\ 2 + ^-\W(z + hg) ■ hg\ 2 > A s J?-\hg\ \VV(z + hg) ■ hg\. 

for S small enough, and where the o(S 2 ) is uniform in 2 . Hence there exists a constant A > 0 such that 

\h s \ <A6 2 , 

for any 6 small enough and any z, whenever \z + hg\ > R. 

Assume now that \z + hg\ < R. Let us write equation (64) as 


V$(z +hg)— V$(z — hg) = 1 - z]/ n exp ( — — f(z + hg) ) \7<S>(z + hg) 


26 

n 


+ 26Z x J n exp (- — f{z + hg) + -V(z + hg)]vf(z + hg). 
\ n n J 


Then /, V and their gradients are continuous and then uniformly bounded on the ball { |ar| < R}, 
so by (65) there exists a constant A such that for all <5 small enough and all z with \z + hg\ < R 


V4>(z + hg) — V4>(2! — hg) 


< AS. 


Hence, by the Cauchy-Schwarz inequality and the bound Hess($) > Id„, a consequence of (66), 


fi 

A5\hg\ > ^V< 1 )( 2 ; + hg) — V4>(2 — hg)^j ■ hg = J hg ■ Hess(4>)(2; + thg)hg 


dt > ^R e min V |^|2 
n 


By uniform convexity the function V is indeed bounded from below on K™, so there exists a constant 
B such that for all <5 small enough and all z with \z + hg \ < R 


\hg\ < B6. 


All cases being covered, our first step is completed. 

2. In a second step we perform a first-order Taylor expansion of the equality (64). For z fixed, it gives 

- 5Vf(z) + Hess {V){z)hg - -f{z)S7V{z) + h& ' VV{z) W(g) + o z (S) = 0, (68) 

n n 

where o z (S ) depends on z, S and hg. Since \hg\ < C <5 by the first step, uniformly in z, one deduces 
from (68) that 

M<5)| < AS 2 (|Hess(F)(z)| + \VV(z)\ 2 + 1) 
for a constant A and for any z. 
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In the sequel we let H(z) denote positive polynomial functions in V(z), XV (z), etc., independent of 6 
small and which can change from line to line. The latter inequality can then be written as 

M<5)| < S 2 H(z). (69) 

Let now X = X f • Hess(F) -1 W and Y = XV ■ Hess(F) -1 W. Taking the scalar product of (68) with 
Hess(F) _1 VF one gets 

hs-XV = S X + ^ + o z (S) 

1 + n 

at the point 2 , where o z (S) satisfies (69) since in particular Hess(F) -1 < p _1 Id„. Then, again by (68), 


hs 


= 8 


Hess (F) _1 V/ 


Hess(V')~ 1 VV' f-X 
n 1 + — 

n 


+ o z (S) 


where again o z (5) satisfies (69). 

We now compute the second-order Taylor expansion of the function <t>$. First, from the expansion e x = 
1 + x + x 2 /2 + x 3 e 6x /6 with 9 £ (0,1), we have at the point 2 , 


(j) S = -nlog(l +if) S ) 


with 


S £ Shs-Xf , h s ■ Ress(V)h s , S 2 j2 , (h s -XVf Sf S 2 , 2 _ 2 

Vs = -/-1-o-h —/ H-- 2 hs ' vv + ~ J d/j, + o z {5 . 

n n 2n n z ZiV n z n 


n 2n n 2 J 2 n 2 n* 

Here o z (S 2 ) now satisfies 

|b z (6 2 )| < 8 3 H(z)ex.p (5 3 K 3 (z)) (70) 

with |A' 3 (z)| < H(|VF| + |V 2 H| + |V 3 F|) for an universal constant A. 

We now observe that for small 8 one has <j>$(z ) < nlog2 for all z, that is, ips(z) > —1/2. Indeed, for small 
8 one has 


Z ^ exp ^=Lf {x) )>l 

uniformly in x £ R n , by (65) and since / is bounded from above. Hence for any h £ 
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Z ¥ n ex P + h ) + 

'V(z + h) 


V(z + h ) 


>2 exp 


+ exp 


+ exp 
V(z - h) 


V(z - h) 


n 


n 

1 

> - 
“ 2 L 


exp 


V(z + h) 


+ exp 


V(z — h) 


> e 


V(z)/r, 


by convexity of e v ^ n . The bound on <j>$ follows by its definition (63). 

Now from the expansion (1 + x)~ n = 1 — nx + n(n + l)x 2 /2 — n(n + l)(n + 2):r 3 (l + 6x )~ n ~ 3 /6 with 
9 £ (0,1) and (68), we get 


(l + iP s )~ n = l+8f + 


hs ■ Hess (V)hg 


8 2 - —l/ 2 . 

2 n 


(h s ■ XV) 2 
2 n 


-8 2 f 2 dn + o z (S 2 ) 


for a o z (5 2 ) satisfying (70): here we use that ips(z) > —1/2 so that 1 + 9ipg > 1/2 in the Taylor expansion, 
uniformly in z and <5. The above expressions of hs and hs ■ XV finally give 

(1 + ^r n = 1 + Sf + y V/ • Hess (Vy'Xf - y (/ n ~ y )2 + y f 2 -8 2 J f 2 dn + o z (S 2 ). 


In conclusion, by integration the second-order Taylor expansion of the Borell-Brascamp-Lieb inequality 
f (1 + ips)~ n dn = f e^ s dn > 1 implies 

J fdn< J X f ■ Hess(H)” 1 V/d/i — J {f ~ X ^ dy. 

for all C 1 compactly supported / such that J fd/i = 0. Here we use that 8~ 2 J d z (S 2 )e~ l '^dz —> 0 as 
8 —> 0 by (70), since the right-hand side in (70) is in L 1 (e _ ' / ) by our hypotheses on V. By definition of 
X and Y this concludes the argument. 
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C Link with G. Harge’s bound (53) 

In this Appendix, we observe that G. Harge’s bound (53) can be formally recovered by linearization in the 

Monge-Ampere equation (28). Let indeed / be a smooth function such that J f = 0, and ji 2 = (1+e /)/i 

for e > 0, and expand the transport map V<p(x) sending /j,i = /j, onto ^ 2 as x + eS79i {x)+£ 2 S76 2 {x) + o{e 2 ). 
Taking logarithms in (28) with such /ri and /r 2 and observing that 

2 

logdet(Hess(v?)) = logdet (V+£Hess(0i)+£ 2 Hess(0 2 )-l-o(£ 2 )^ = eA8i+e 2 A9 2 — ^-tr[(Hess(0i)) 2 ] +o(e 2 ), 
a second-order Taylor expansion ensures that / = — L9\ in the first-order terms; moreover 
f 2 = -V6»i • Hess(F)V0i + 2 L8 2 + 2V/ • V0i - tr[(Hess(0i)) 2 ] 
in the second-order terms. Assume now that Hess(V) > 0, and let M = Hess(F) 1 / 2 > 0. Then 
-V0i • Hess(T/)V0i + 2V/ • V0i = |M _1 V/| 2 - |MV0i - M _1 V/1 2 


so that 


J f 2 d^ = J V/-Hess(V) _1 V/dju-y (|MV0i — M _1 V/| 2 + tr[(Hess(0i)) 2 ]^ dfi (71) 

by integration. At this point one recognizes terms in the proof of [31, Th. 1] : one observes that / = —L9\ 
so V/ = M 2 9\ — X by differentiation, where X £ R" is the vector with coordinates L(di8 1 ); hence 

|MV0i - M -1 V/| 2 = \M~ 1 X\ 2 > i|X| 2 

if moreover Hess(Id) < S. In particular 

J \MX8 1 -M- 1 Xf\ 2 d^> | (/Wi)) 2 ^ | (dpt) 2 dfj, 

i i,j 

by (55), if Hess(H) > i?Id n . Hence 

j (|MV0 1 -M- 1 V/ l | 2 +tr[(Hess(0 1 )) 2 ])d Al > (l + |) ^ (l + |) (/A0, d^j (72) 

since moreover by the Cauchy-Schwarz inequality 

2 / \ 2 2 

(J A0i cfyx) = i'^2 J d ii 9 1 dfij <%i0id/z) <«5Z(y dijdidu 

^ 'Vi / i ' ' i,j ' 

By (71) and (72) we finally recover (53) since by integration by parts and (55) 

J A0i d/i = y V0i • VH e _l dx = — J L9i V e -v dx = J fV d/j. 
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